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@ Isolation of full-length cDNA clones and capped mRNA. 



0 A protein comprising at least a first functional 
site having tiie ability to bind the cap structure of 
mRNA and a second functional site having the ability 
to bind a solid support matrix in such a manner as to 
allow the first functional site to be immobilized and 
still remain functionally accessible to interact with 
^the cap structure of mRNA. Also within the scope of 
-fa the present invention is a method for generating a 
5-cDNA library mostly containing full-length cDNAs. 
05 The method comprises the incubation of a mixture 
jY) comprising mRNA:cDNA hybrids with 1) a single 
rs strand RNA specific nuclease and 2) the above- 
rO mentioned- protein. The resulting mixture is then 
Q passed through a column comprising a support ma- 
trix having the ability to bind the second functional 
Mjsite of the above-mentioned protein in order to se- 
lectively bind complete mRNA:cDNA hybrids. The 
mRNA:cDNA hybrids are then competitively eiuted 



with a cap analog and full-length cDNA strands are 
separated and recovered. The present invention also 
includes a method for purifying capped mRNA using 
the above-mentioned protein. The process com- 
prises the incubation of a mixture containing mRNA 
with the above-mentioned protein, passing the result- 
ing mixture through a column comprising the support 
matrix having the ability to bind to the second func- 
tional site of the above-mentioned protein in order to 
selectively bind capped mRNAs, and competitively 
eluting the capped naRNAs with a cap analog. 
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Isolation of full-length cC 

BACKGROUND OF THE INVENTION 

Complementary DNA contains the information 
coding for the synthesis of proteins. The ability 
generate complementary DNA (cDNA) libraries is 
one of the most fundamental procedures in con- 
temporary molecular biology. Research involving 
the use of cDNA libraries has already led to signifi- 
cant breakthroughs in our understanding of cancer. 
AIDS and numerous other medical concerns. Con- 
sequently, there is a rapidly expanding commercial 
interest in this procedure because of its enormous 
current and future potential applicability. For exam- 
ple, a growing number of conripanies are marketing 
"ready made" cDNA libraries or kits which simplify 
the task of preparing a cDNA library. 

While the procedures for generating cDNA li- 
braries are being continuously modified and im- 
proved, there are serious drawbacks in the current 
methods that have not been adequately addressed. 
As a result, cDNA cloning is generally inefficient, 
making it both cumbersome and most unfortunately 
very time consuming. 

In standard methods currently used for the 
preparing of cDNA libraries, the mRNA in the cell 
is isolated by virtue of the presence of a 
polyadenylated tail present at its 3 end which 
binds to a- resin specific for this structure (oligo T- 
chromatography). The purified mRNA is then 
copied into cDNA using the. enzyme, reverse tran- 
scriptase, which starts at the 3' end of the mRNA 
and proceeds towards the 5 end. Second strand 
synthesis is then performed. Linkers are added to 
the ends of the double stranded cDNA to allow for 
its packaging Into virus or cloning into plasmids. At 
this stage, it is in a form that can be propagated, 
the sum of which is termed the cDNA library. 

Unfortunately, the major problem with the ac- 
tual technology is that the majority of the cDNAs 
present in any given library are not full-length be- 
cause the reverse transcriptase enzyme in the ma- 
jority of cases does not make a complete copy of 
the mRNA. Obviously, this creates serious prob- 
lems, especially if one takes into account the fact 
that the efficiency of copying is inversely propor- 
tional to the length of the mRNA. This results in the 
majority of the genetic information in a cDNA li- 
brary having an overabundance of incomplete 
pieces. 

Hence, an incomplete or non full-length cDNA 
usually does not have the entire genetic blueprint 
required to make a functional protein and is there- 
fore of limited scientific value. Usually, investigators 
must perform many rounds of isolation (screenings) 
and construct a "full-length" cDNA from the accu- 



A clones and capped mRNA. 

mulated pieces. Consequently, valuable time and 
scientific resources are lost. Obviously, the prob- 
lem becomes even more acute when long cDNAs 
are sought. Additionally, some fragments of the 

5 desired cDNAs might be so underrepresented in 
the library that it may be impractical to identify and 
isolate all the required segments. 

Furthermore, in cDNA libraries produced by 
conventional methods, there is dismal under-repre- 

;o sentation of sequences close to .the 5 end of 
mRNAs since the reverse transcriptase will usually 
"fall off" before reaching these sequences. This is 
unfortunate since there is a growing interest in 
isolating these 5 proximal sequences, in light of 

/5 recent studies pointing to the importance of such 
sequences in regulating gene expression. 

Another problem concerning cDNA synthesis is 
the source and quality of the mRNA used. Using 
present day technology, the mRNA that is used as 

20 a source for cDNA synthesis is purified by its 3 
end polyadenylated tail. However, some mRNAs do 
not possess a 3' end but all mRNAs have a 5 cap 
structure. Consequently, a cDNA library construct- 
ed from this source of mRNA would be more 

25 representative of the total genetic information 
present in the cell. In recent years, unsuccessful 
attempts have been made to develop antibodies 
directed against the cap structure of mRNA. The 
problems usually encountered were related to the 

30 insufficient affinity of the antibodies for the cap. 
This major drawback made it impossible to develop 
isolation protocols for capped mRNAs. 

Therefore, it would be highly desirable to de- 
velop a method that would increase the ability of 

35 scientists to isolate both full-length cDNA clones 
and capped mRNA, 

SUMMARY OF INVENTION 

40 

In accordance with the present invention, there 
Is provided a protein useful for the preparation. of 
cDNA libraries mostly containing full-length cDNA 
clones. The protein can also be used for the isola- 

45 tion of capped mRNA. The protein of the present 
invention is a multifunctional protein comprising at 
least two functional sites. The first functional site 
has the ability to bind the cap structure of mRNA 
and the second functional site has the ability to 

50 bind a solid support matrix in such a manner as to 
allow said first functional site to be immobilized 
and still remain functionally accessible to interact 
with the cap structure of mRNA. Preferably, a pro- 
tein of the present invention is a bifunctional fusion 
protein having one functional site that has the abil- 
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ity to bind the cap structure of mRNA from 
eucaryotic cells and another functional site having 
the ability to bind to a solid support matrix. 

Also within the scope of the present invention 
is a method for generating a cDNA library mostly 
containing full-tength cONAs. This method com- 
prises a first step in which a mixture comprising 
mRNAxDNA hybrids is, incubated with 

1) a single-strand RNA specific nuclease; 

and 

2) a multifunctional protein comprising at 
least a first functional site having the ability to bind 
the cap structure of mRNA and a second functional 
site having the ability to bind a solid support ma- 
trix. 

The mixture is then passed through a column 
comprising a support matrix having the ability to 
bind to the second functional site of the protein in 
order to selectively bind complete mRNA:cDNA 
hybrids to the matrix. The mRNAxDNA hybrids are 
then competitively eluted with a cap analog and the 
full-length cDNA strands are then separated and 
recovered. Preferably, the single-strand RNA spe- 
cific nuclease that is used for incubating the 
mRNAxDNA hybrids mixture is Ti nuclease 
whereas the preferred cap analog is m'GDP. 

Also within the scope of the present invention 
is a method for purifying capped mRNA. This 
method comprises the incubation of a mixture com- 
prising mRNA with a protein having a first func- 
tional site which has the ability to bind the cap 
structure of mRNA and a second functional site 
having the ability to bind a solid support matrix. 
This mixture is then passed through a column 
comprising a support matrix having the ability to 
bind to the second functional site of the protein in 
order to selectively bind capped mRN As to the 
matrix. The capped mRNAs are then competitively 
eluted with a cap analog such as m^GDP and thus 
capped mRNAs are separated and recovered. 

In a preferred embodiment of the present in- 
vention, the protein used for generating both cDNA 
libraries containing full-length cDNAs and pure 
capped mRNAs is a bifunctional protein, preferably 
a fusion protein of the type protein A/elF.4E fusion 
protein. 

Finally, the present invention also includes a 
resin for the purification of proteins having a func- 
tional cap binding site, said resin comprising an 
oxidized cap analog covalently attached to a solid 
support matrix. Also included is a method for the 
preparation of the resin for the purification of pro- 
teins having a functional cap binding site, said 
method comprising; 

- oxidizing a cap-analog to yield a reactive dial- 
dehyde, and; 

- covalently attaching said oxidized cap-analog to a 
solid support matrix. 



Therefore, the product of the present invention 
will allow, through its selective binding of capped 
mRNA, an improvement in the quality of cDNA 
libraries. This, in return, will allow the identification 

5 of important genes that are not part of present day 
cDNA libraries. Furthermore, the product of the 
present invention can be used to purify capped 
mRNAs selectively in a reproducible manner. 

Other advantages of the present invention will 

70 be readily illustrated by referring to the following 
description. 

IN THE DRAWING 

Figure 1 represents the pRIT2T/elF-4E plas- 

mid. 



DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to a novel protein 
useful in construction of full-length cDNA libraries 
and the isolation of full-length cDNAs. 

Essentially, the product of the present invention 
has to be at least bifunctional in that it must have 
the ability to bind the cap structure- of mRNA while 
also having the ability to bind a solid support 
matrix so that the cap binding portion of this pro- 
tein can be immobilized and still remain function- 
ally accessible to interact with the mRNA cap 
structure. The resulting product is a multifunctional 
protein that has the^ ability to purify capped 
mRNAs. 

Preferably, the product that will be used in the 
context of the present invention is a genetically 
engineered fusion protein that can bind both cap 
structures of mRNA and a molecule attached to a 
solid support. However, it is to be understood that, 
the product of ihe present invention is not limited to 
fusion proteins but intends to cover all genetically 
engineered multifunctional proteins possessing the 
ability to bind to both the cap structure of mRNA 
and a solid support mate. 

In order to fully appreciate the approach used 
in the context of the present invention, it might be 
useful to consider that one of the intermediate 
steps in cDNA synthesis results in a mRNAxDNA 
hybrid. When this hybrid is obtained, it is neces- 
sary to add an enzyme that specifically degrades 
single-stranded RNA. If the cDNA is complete or 
full-length, it will cover the entire mRNA and pro- 
tect it against degradation. However, if the cDNA is 
not complete, then that portion of the mRNA which 
is not protected will be degraded. This will invari- 
ably lead to the loss of the s' cap structure from 
the remaining mRNAxDNA hybrid. Thus, the spe- 
cific isolation of full-length mRNAxDNA hybrids will 
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occur when using the fusion protein of the present 
invention that can bind cap structures because only 
the full-length mRNAxDNA hybrids will possess a 
5' cap structure. The resulting cONA library will 
then have full-length clones only and represents an 
ideal library for cDNA cloning. 

Cap structure and cap binding protein 



From alt the eucaryotic cellular mRNAs that 
have been analyzed to date, all of these mRNAs 
have a structural modification at their 5 end 
termed the cap structure or "cap" which consists 
of the structure m'GpppX. where X can be any 
nucleotide. Certain proteins or protein portions, 
have the ability to bind the cap structure and are 
termed cap binding proteins (CBPs). Thus, if an 
mRNA has a cap structure, then the cap binding 
protein will specifically bind the cap In a non- 
covalent fashion. The affinity of the protein for the 
cap structure is high, readily allowing the specific 
retention of capped RNAs as opposed to uncapped 
RNAs. 

The product of the present invention requires a 
portion having the ability to bind the cap structure 
of mRNA. 

Preferably, a 24 kOa cap binding protein 
(CBP). which is known as eucaryotic initiation factor 
4E (elF-4E) may be used in the context of the 
present invention. This protein is found in the 
cytoplasm of all eucaryotic cells including animal, 
plant and yeast. However, it Is to be understood 
that any protein or protein portion that can specifi- 
cally bind the cap structure of mRNA can be con- 
sidered as being a useful part of the present inven- 
tion. 



Solid support matrix binding proteins 



The second essential feature of the product of 
the present invention is that it must possess a 
portion having an affinity for molecules that could 
be bound to a solid support matrix. However, the 
product must be attached to the support matrix in 
such a manner as to allow the cap binding portion 
to interact with the cap structure of mRNAs. 

For example, staphylococcal protein A that has 
the ability to bind IgG immunoglobulins could be 
used in combination with a resin that has IgG 
antibodies attached to it. Also, it could be possible 
to use j9-galactosidase in conjunction with an anti-^ 
galactosidase antibody resin. In fact, any protein or 
protein portion that could possibly be linked to a 
solid support matrix could be used In the context of 
the present invention. 

Therefore, although the present invention will 



highlight the use of a fusion protein containing both 
a cap binding protein and a protein having the 
ability to bind to a solid support matrix, it is to be 
understood that the present invention is not limited 
5 to these types of proteins. In fact, any multifunc- 
tional protein possessing the ability to bind both 
cap structures of mRNA and a solid support matrix 
could be useful in the context of the present inven- 
tion. 

10 

Process for the obtention of full-length cDNAs 



Once a mixture containing mRNA:cDNA hy- 

ts brids has been obtained through methods generally 
known to those skilled in the art, it is incubated 
with a single-strand RNA specific nuclease. Prefer- 
ably. Tt nuclease (RNase Tt from Aspergillus 
oryzae). an endonuclease that specifically attacks 

20 the 3' adjacent phosphodiester bound GpN, can be 
used as a- single-stranded RNA specific nuclease. 
The naturally modified m^G part of the cap struc- 
ture wiii not be recognized by this enzyme. The 
use of RNAse Ti for probing single-strand specific 

25 regions Is well documented and widely known to 
those skilled in the art. RNAs T\ will not attack 
RNA that is hybridized to DNA and it is therefore 
well suited for the purposes of the present inven- 
tion. However, It is to be understood that any 

30 single-strand RNA specific nuclease could also be 
used in the context of the present invention 

Thus, if the reverse transcriptase copies the 
entire length of the mRNA. or if it falls short of a 
few nucleotides such that there is. no unhybridlzed 

35 GpN residue in the corresponding mRNA. RNAse 
Ti . which will only digest unpaired GpN residues, 
will not degrade the mRNA and as a result, the cap 
structure will remain covalently bound to the 
mRNA:cDNA hybrid. If, however, cDNA synthesis 

40 was not complete, the single-strand RNA specific 
nuclease will degrade unpaired RNA and remove 
the cap structure from the mRNA:cDNA hybrid. 

Following nuclease treatment, the mRNA:cDNA 
hybrids are incubated with the multifunctional pro- 

45 tein of the present invention. As a result of this 
incubation, only those mRNA:cDNA hybrids that 
have a covalently attached cap structure will bind 
to the protein of the present invention. By applying 
^the mixture to a resin having a strong affinity with a 

50 functional site of the protein of the present Inven- 
tion, all the non-capped containing hybrids, or in- 
complete cDNAs. will wash through. The bound 
full-length capped mRNA:cDNA hybrids will then 
be competitively eluted with a cap analog such as 

55 m^GDP. 

The resulting purified fraction contains only full- 
length or near full-length first strand cDNAs which 
then act as templates for second strand synthesis. 
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The steps for completing the cDNA library are the 
same as those normally used by those skilled in 
the art. Essentially, the present invention lies in the 
fact that a novel step that discards incomplete 
cDNAs and readily selects for only full-length 
cDNAs to be present in the cDNA library has been 
added to standard cDNA preparation procedures. 



Affinity resin for purifying cap binding proteins 



The selective purification of cap binding pro- 
teins or fusion proteins with a functional cap bind- 
ing site is most efficiently accomplished by affinity 
chromatography using cap-analogs covalently at- 
tached to a solid support matrix. Although several 
cap-analog resins have been devised, and one is 
presently available from Pharmacia, a new cap- 
analog resin that is less expensive, very rapid, and 
less demanding to prepare than those previously 
reported forms part of. the present invention. 

The synthesis of the cap-analog resin is per- 
formed in the following manner. A cap-analog, such 
as m^GDP, is oxidized in the presence of periodate 
to yield a reactive dialdehyde. Upon incubation of 
the oxidized cap-analog with adipic-acid 
dihydrazide agarose (Pharmacia) a hydrazone bond 
is formed. The hydrazone bond is further stabilized 
by reductive amination in the presence of sodium 
cyanoborohydride (NaBHaCN). This results in a 
cap-analog covalently attached (through a spacer) 
to a solid support matrix. The binding efficiency is 
approximately 90% of the input cap-analog and the 
resin is stable for months at 4°C. The procedure 
requires minimal steps and all steps are based an 
simple chemical reactions. 

Affinity purification of capped mRNAs 

Independent of its use in constructing full- 
length cDNAs. the protein of the present invention, 
when used in combination with a suitable binding 
resin, can be used to purify capped mRNAs. In 
cDNA synthesis, there are two major advantages of 
purifying mRNA by the cap structure rather than 
using the conventional poly A tail purification. 

First, not all eucaryotic cellular mRNAs have a 
poly A tail at their 3 end whereas all mRNAs 
analyzed to date have a s' cap structure. Con- 
sequently, the source of mRNA purified will be 
more representlve of the entire spectrum present in 
the cell. 

Secondly, by purifying mRNAs by their cap 
structure, it is possible to minimize the percentage 
of degraded mRNA molecules that are normally 
used as substrates for cDNA synthesis. This fea- 
ture is extremely important because one of the 



most variable and important criteria in the genera- 
tion of a good cDNA library is the quality of the 
mRNA that is used. If an mRNA is p=?rtially deg- 
raded, it can still be copied by the reverse tran- 
5 scriptase enzyme as long as there is a 3 poly A 
tail, thereby exacerbating the problem of incom- 
plete cDNA. 

However, if mRNA is purified by its cap struc- 
ture and it is partially degraded (i.e. 3' sequence 

10 and poly A tail are not present), it will not be a 
substrate for oligo(dT) primed reverse transcription. 
Only mRNAs which have a cap and a poly A tail 
simultaneously will be a substrate for cDNA syn- 
thesis. Invariably, only full-length mRNAs satisfy 

;s this criteria and their use will enhance the quality of 
present day cDNA libraries. 

One must bear in mind that the isolation of 
mRNA is not always related to cDNA synthesis. For 
example, the in vitro synthesis of mRNA by using 

20 the SP6 system (Promega-Biotec) is widely used. 
However, the ability to generate capped mRNAs is 
somewhat variable as it pertains to the efficiency of 
capping. Therefore, a mixed population of capped 
and uncapped mRNA is synthesized and this mix- 

25 ture could easily be separated using the system of 
the present invention. 

The following example is introduced in order to 
illustrate rather than limit the scope of the present 
invention 

30 

Example 1 

35 Construction of the protein A/elF-4E fusion protein. 

In order to produce the bifunctional protein 
A/elF-4E fusion protein, the yeast elF-4E gene was 
fused to staphylococcal protein A, by recombinant 
40 DNA technology. The elF-4E gene was placed in 
front of protein A using Pharmacia vector pRIT2T. 
This vector allows for the efficient overproduction 
of a protein A/elF-4E fusion protein. 

45 

Yeast elF-4E gene 

The yeast elF-4E gene was isolated using the 
method described in Altmann et al., Molecular and 

50 Cell Biology 7(1987) p. 998. To create the fusion 
protein, the yeast elF-4E gene was mutated by site 
directed mutagenic in order to obtain a unique 
BamHI restriction site at the translation start codon. 
The use of BamHI and Hindlll enabled the isolation 

55 of the entire coding sequence of elF-4E except for 
the first amino acid which is lost as a result of 
mutagenesis. 
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Protein A 

Staphylococcal protein A has the ability to bind 
IgG immunoglobulins. Protein A was used because 
the binding constant of protein A to IgG is remark- 
ably high thereby minimizing the toss of fusion 
protein from the IgG resin. This feature is important 
because it allows the purification scheme to be 
repeated with the same material, thereby increas- 
ing the cost-effectiveness of the product. Further- 
more. IgG and the resin to which it is covalentty 
bound is rather cheap, effective and easy to pre- 
pare. Finally, a commercially available gene fusion 
vector sold by Pharmacia under the name pRIT2T 
with protein A sequences placed in an appropriate 
location allows for an easy overproduction of pro- 
tein A fusion protein. 



Introduction of elF-4E into pRIT2T and transforma- 
tion of E. coli 7 

The mutated yeast elF-4E gene described 
above is subcloned into KS vector (Pharmacia) into 
Baml-Hindlll site and subsequently cut with Hindlll. 
the ends are Klenow repaired. BamHl linkers are 
then added, and the desired eiF-4E fragment is cut 
with BamHl and isolated using standard proce- 
dures. The pRIT2T vector is then cut with BamHl 
and the mutated eIF-4E gene is then ligated to the 
pRIT2T vector and transformed into E. coliN4a30-l. 
The transformation procedures are tlibse^generally 
used by those skilled in the art. The resulting 
transformed E. coli strain was given the designation 
A-4E. The plasmid containing the desired elF-4E 
fragment was deposited at the American Type Cul- 
ture Collection (ATCC) and given the accession 
number 40522. 



Expression and isolation of the protein A/elF-4E 
fusion protein 

The use of the pRIT2T vector allows for the 
efficient temperature-inducible expression of in- 
tracellular fusion proteins in E. coli. Following the 
manufacturer's (Pharmacia) pTocedure. the trans- 
formed E. coll cells are grown to an O.D.eoo value 
of approldmaTely 1.0 at 3.0°C. The temperature is 
then raised from 30°C to 42°C for 2 hours. The 
culture is then sonicated in a buffer containing a 
mild detergent and centrifuged at low speed spin in 
order to discard cellular debris. The supernatant 
liquid is then centrifuged at high speed in order to 
obtain high yields of the fusion protein. This high 
speed centrifugation step is not part of the proce- 
dure described by the manufacturer and was intro- 
duced in order to enhance production yields. 



The overexpressed protein is then purified to 
homogeneity by passing the £. coli extract over a 
cap analog affinity resin ofThi^ype described 
above such as m^GDP-agarose. Only the fusion 

5 protein binds the cap-analog resin because of its 
affinity for caps and the other contaminating pro- 
teins are removed by washing the affinity resin with 
low salt containing buffer. 

The bound fusion protein is then specifically 

10 eluted with saturating amounts of a cap analog 
such as m^GDP. which competes for cap specific 
binding sites on the fusion protein. The excess 
m^GDP present with the purified fusion protein is 
removed by dialysis to yield the fusion protein that 

T5 can bind cap structures. Approximately 2 to 3 
milligrams of pure fusion protein can be obtained 
for each liter of culture media. The fusion protein 
thus obtained has proven to be stable for several 
months at 4*'C, apart from being easily overproduc- 

20 ed and purified by simple and inexpensive meth- 
ods. 



Solid-support matrix used for immobilization of the 
25 fusion protein. '■ 

In order to immobilize the fusion protein of the 
present invention, it is necessary to use a resin that 
has an IgG antibody attached to it. This allows for 

30 the specific retention of the fusion protein through 
its protein A portion, thereby allowing the elF-4E 
portion to be free to interact with cap mRNAs. 
Resins of that type are presently available com- 
mercially but it was found that the commercially 

35 available resins especially those sold by Pharmacia 
were contaminated with nucleases that degrade 
mRNA. thereby making it impossible to isolate 
good quality mRNA For the purposes of the 
present invention, a resin synthesized using IgG 

40 antibodies obtained from ICN and Affigel-10 resin 
from Bio-Rad has been used. The column has 
been found to be stable for at least several months 
at 4°C. 

45 

Clalnns 

1. A protein comprising at least a first func- 
tional site having the ability to bind the cap struc- 

50 ture of mRNA and a second functional site having 
the ability to bind a solid support matrix in such a 
manner as to allow said first functional site to be 
immobilized and still remain functionally accessible 
to interact with the cap structure of mRNA. 

55 2, The protein of Claim 1, which is a bifunc- 

tional protein. 

3. The protein of Claim i, which is a fusion 
protein. 
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4. The protein of Claim 3, which Is the protein 
A/elF-4E fusion protein. 

5. The protein of Claim 1, wherein the first 
functional site has the ability to bind the cap struc- 
ture of mRNA from eucaryotic cells. 

6. The protein of Claim V, wherein the first 
functional site is the functional site of elF-4E. 

7. The protein of Claim 1, wherein the second 
functional site is the functional site of protein A 
having an affinity for IgG antibodies. 

8. A method for generating a cDNA library 
mostly containing full-length cDNAs. said method 
comprising: - incubating a mixture comprising 
mRNAxDNA hybrids with 

1) a single-strand RNA specific nuclease 

2) a protein comprising at least a first func- 
tional site having the ability to bind the cap struc- 
ture of mRNA and a second functional site having 
the ability to bind a support matrix in such a 
manner as to allow said first functional site to be 
immobilized and still remain functionally accessible 
to interact with the cap structure of mRNA. - pass- 
ing said mixture through a column comprising a 
support matrix having the ability to bind to said 
second functional site of said protein, thereby se- 
lectively binding complete mRNAxDNA hybrids to 
said matrix; 

- eluting said complete mRNAxDNA hybrids with a 
cap analog: and 

- separating and recovering full length cDNA 
strands. 

9. The method of Claim 8, wherein said single- 
strand ftNA specific nuclease is Ti nuclease. 

10. The method of Claim 8, wherein said pro- 
tein is the protein A/elF-4E fusion protein. 

11. The method of Claim 8, wherein said cap 
analog is m'GDP. 

12. A method for purifying capped mRNA 
which comprises: 

- incubating a mixture containing mRNA with a 
protein comprising at least a first functional site 
having the ability to bind the cap structure of 
mRNA and a second functional site having the 
ability to bind a solid support matrix in such a 
manner as to allow said first functional site to be 
immobilized and still remain functionally accessible 
to interact with the cap structure of mRNA; 

- incubating said mixture with a support matrix 
having the ability to bind to said second functional 
site of said protein, thereby selectively binding 
capped mRNAs to said matrix: 

competitively eluting said capped mRNAs with a 
cap analog; and 

- separating and recovering capped 'mRNAs. 

13. The method of Claim 12, wherein said 
protein is the protein A/elF-4E fusion protein. 

14. The method of Claim 12, wherein said cap 
analog is m^GDP. 



15. A plasmid having the identifying character- 
istics of ATCC number 40522. 

16. A resin for the purification of proteins hav- 
ing a functional cap binding site, said resin com- 

5 prising an oxidized cap analog covalently attached 
to a solid support matrix. 

17. A method for the preparation of the resin of 
Claim 16, said method comprising: 

- oxidizing a cap-analog to yield a reactive dial- 
10 dehyde, and; 

- covalently attaching said oxidized cap-analog to a 
solid support matrix. 
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© A protein comprising, at least a first functiona 
sue having the ability to bind the cap structure of 
mRNA and a second functional site having the ability 
to bind a solid support matrix in such a manner as to 
allow the first functional site to be immobilized and 
still remain functionally accessible to interact with 
the cap structure of mRNA. Also within the scope of 
the present invention is a method for generating a 
cDNA library mostly containing full-length cDNAs. 
The method comprises the incubation of a mixture 
comprising mRNA:cDNA hybrids with 1) a single 
strand RNA specific nuclease and 2) the above- 
mentioned protein. The resulting mixture is then 
passed through a column comprising a support ma- 
irix having the ability to bind the second functional 



site of the above-mentioned protein in order to se- 
lectively bind complete mRNA.cDNA hybrids. The 
mRNA:cDNA hybrids are then competitively eluted 
with a cap analog and full-length cDNA strands are 
separated and recovered. The present invention also 
includes a method for purifying capped mRNA using 
the above-mentioned protein. The process com- 
prises the incubation of a mixture containing mRNA 
with the above-mentioned protein, passing the result- 
inq mixture through a column comprising the support 
matrix having the ability to bind to the second func- 
tional site of the above-mentioned protein m order o 
selectively bind capped mRNAs. and competitively 
eluting the capped mRNAs with a cap analog. 
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